Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 6966 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 836.9 KiB |
| Average record size in memory | 123.0 B |
Variable types
| Categorical | 6 |
|---|---|
| Numeric | 9 |
| Boolean | 3 |
date has a high cardinality: 364 distinct values | High cardinality |
month is highly correlated with year | High correlation |
year is highly correlated with month | High correlation |
dayofweek_n is highly correlated with working_day | High correlation |
working_day is highly correlated with dayofweek_n | High correlation |
month is highly correlated with year | High correlation |
year is highly correlated with month | High correlation |
dayofweek_n is highly correlated with working_day | High correlation |
working_day is highly correlated with dayofweek_n | High correlation |
hour is highly correlated with rain and 4 other fields | High correlation |
rain is highly correlated with hour and 4 other fields | High correlation |
temp is highly correlated with year and 3 other fields | High correlation |
rhum is highly correlated with year and 3 other fields | High correlation |
wdsp is highly correlated with year and 3 other fields | High correlation |
day is highly correlated with year and 3 other fields | High correlation |
month is highly correlated with year and 3 other fields | High correlation |
year is highly correlated with hour and 9 other fields | High correlation |
holiday is highly correlated with hour and 9 other fields | High correlation |
dayofweek_n is highly correlated with working_day and 1 other fields | High correlation |
working_day is highly correlated with hour and 9 other fields | High correlation |
peak is highly correlated with hour and 9 other fields | High correlation |
working_day is highly correlated with dayofweek | High correlation |
year is highly correlated with season | High correlation |
season is highly correlated with year | High correlation |
dayofweek is highly correlated with working_day | High correlation |
hour is highly correlated with peak and 1 other fields | High correlation |
rain is highly correlated with rain_type | High correlation |
temp is highly correlated with month and 2 other fields | High correlation |
month is highly correlated with temp and 2 other fields | High correlation |
year is highly correlated with temp and 2 other fields | High correlation |
dayofweek_n is highly correlated with dayofweek and 1 other fields | High correlation |
dayofweek is highly correlated with dayofweek_n and 1 other fields | High correlation |
working_day is highly correlated with dayofweek_n and 2 other fields | High correlation |
season is highly correlated with temp and 2 other fields | High correlation |
peak is highly correlated with hour and 1 other fields | High correlation |
timesofday is highly correlated with hour | High correlation |
rain_type is highly correlated with rain | High correlation |
date is uniformly distributed | Uniform |
hour has 266 (3.8%) zeros | Zeros |
rain has 6325 (90.8%) zeros | Zeros |
dayofweek_n has 962 (13.8%) zeros | Zeros |
Reproduction
| Analysis started | 2022-04-15 22:03:35.844625 |
|---|---|
| Analysis finished | 2022-04-15 22:03:49.198078 |
| Duration | 13.35 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 364 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 54.5 KiB |
| 2021-12-15 | 24 |
|---|---|
| 2021-07-24 | 24 |
| 2022-02-05 | 24 |
| 2022-02-04 | 24 |
| 2021-06-27 | 23 |
| Other values (359) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021-03-01 |
|---|---|
| 2nd row | 2021-03-01 |
| 3rd row | 2021-03-01 |
| 4th row | 2021-03-01 |
| 5th row | 2021-03-01 |
Common Values
| Value | Count | Frequency (%) |
| 2021-12-15 | 24 | 0.3% |
| 2021-07-24 | 24 | 0.3% |
| 2022-02-05 | 24 | 0.3% |
| 2022-02-04 | 24 | 0.3% |
| 2021-06-27 | 23 | 0.3% |
| 2021-09-04 | 23 | 0.3% |
| 2021-08-01 | 23 | 0.3% |
| 2022-02-16 | 23 | 0.3% |
| 2022-02-11 | 23 | 0.3% |
| 2021-09-11 | 23 | 0.3% |
| Other values (354) | 6732 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 2021-12-15 | 24 | 0.3% |
| 2022-02-04 | 24 | 0.3% |
| 2021-07-24 | 24 | 0.3% |
| 2022-02-05 | 24 | 0.3% |
| 2021-09-04 | 23 | 0.3% |
| 2021-08-01 | 23 | 0.3% |
| 2022-02-16 | 23 | 0.3% |
| 2022-02-11 | 23 | 0.3% |
| 2021-09-11 | 23 | 0.3% |
| 2022-01-27 | 23 | 0.3% |
| Other values (354) | 6732 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.82931381 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 266 |
| Zeros (%) | 3.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 8 |
| median | 13 |
| Q3 | 18 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 6.3472844 |
|---|---|
| Coefficient of variation (CV) | 0.4947485496 |
| Kurtosis | -0.8357483945 |
| Mean | 12.82931381 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.2775842676 |
| Sum | 89369 |
| Variance | 40.28801926 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=24)
| Value | Count | Frequency (%) |
| 17 | 360 | 5.2% |
| 18 | 358 | 5.1% |
| 14 | 358 | 5.1% |
| 13 | 356 | 5.1% |
| 11 | 355 | 5.1% |
| 15 | 354 | 5.1% |
| 16 | 352 | 5.1% |
| 12 | 352 | 5.1% |
| 9 | 352 | 5.1% |
| 10 | 349 | 5.0% |
| Other values (14) | 3420 |
| Value | Count | Frequency (%) |
| 0 | 266 | |
| 1 | 168 | |
| 2 | 149 | |
| 3 | 130 | 1.9% |
| 4 | 117 | 1.7% |
| 5 | 134 | 1.9% |
| 6 | 222 | |
| 7 | 293 | |
| 8 | 348 | |
| 9 | 352 |
| Value | Count | Frequency (%) |
| 23 | 278 | |
| 22 | 306 | |
| 21 | 317 | |
| 20 | 344 | |
| 19 | 348 | |
| 18 | 358 | |
| 17 | 360 | |
| 16 | 352 | |
| 15 | 354 | |
| 14 | 358 |
| Distinct | 44 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05960378984 |
| Minimum | 0 |
|---|---|
| Maximum | 10.3 |
| Zeros | 6325 |
| Zeros (%) | 90.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.3 |
| Maximum | 10.3 |
| Range | 10.3 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3295045132 |
|---|---|
| Coefficient of variation (CV) | 5.528247685 |
| Kurtosis | 209.3943547 |
| Mean | 0.05960378984 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.40684525 |
| Sum | 415.2 |
| Variance | 0.1085732242 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=44)
| Value | Count | Frequency (%) |
| 0 | 6325 | |
| 0.1 | 194 | 2.8% |
| 0.2 | 86 | 1.2% |
| 0.3 | 56 | 0.8% |
| 0.4 | 44 | 0.6% |
| 0.6 | 38 | 0.5% |
| 0.5 | 28 | 0.4% |
| 0.7 | 26 | 0.4% |
| 0.8 | 21 | 0.3% |
| 0.9 | 19 | 0.3% |
| Other values (34) | 129 | 1.9% |
| Value | Count | Frequency (%) |
| 0 | 6325 | |
| 0.1 | 194 | 2.8% |
| 0.2 | 86 | 1.2% |
| 0.3 | 56 | 0.8% |
| 0.4 | 44 | 0.6% |
| 0.5 | 28 | 0.4% |
| 0.6 | 38 | 0.5% |
| 0.7 | 26 | 0.4% |
| 0.8 | 21 | 0.3% |
| 0.9 | 19 | 0.3% |
| Value | Count | Frequency (%) |
| 10.3 | 1 | |
| 5.5 | 1 | |
| 5.2 | 1 | |
| 5.1 | 1 | |
| 4.9 | 1 | |
| 4.7 | 1 | |
| 4.6 | 1 | |
| 4.5 | 1 | |
| 4.2 | 1 | |
| 3.6 | 1 |
| Distinct | 284 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.74239162 |
| Minimum | -4 |
|---|---|
| Maximum | 26.3 |
| Zeros | 7 |
| Zeros (%) | 0.1% |
| Negative | 54 |
| Negative (%) | 0.8% |
| Memory size | 54.5 KiB |
Quantile statistics
| Minimum | -4 |
|---|---|
| 5-th percentile | 2.6 |
| Q1 | 7.025 |
| median | 10.6 |
| Q3 | 14.5 |
| 95-th percentile | 18.775 |
| Maximum | 26.3 |
| Range | 30.3 |
| Interquartile range (IQR) | 7.475 |
Descriptive statistics
| Standard deviation | 5.002159358 |
|---|---|
| Coefficient of variation (CV) | 0.4656467141 |
| Kurtosis | -0.4056822749 |
| Mean | 10.74239162 |
| Median Absolute Deviation (MAD) | 3.7 |
| Skewness | 0.09826056311 |
| Sum | 74831.5 |
| Variance | 25.02159824 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 10.1 | 69 | 1.0% |
| 8 | 66 | 0.9% |
| 10.7 | 64 | 0.9% |
| 10.6 | 64 | 0.9% |
| 13.2 | 64 | 0.9% |
| 7.6 | 63 | 0.9% |
| 8.9 | 63 | 0.9% |
| 8.7 | 62 | 0.9% |
| 9.3 | 62 | 0.9% |
| 11.3 | 60 | 0.9% |
| Other values (274) | 6329 |
| Value | Count | Frequency (%) |
| -4 | 1 | < 0.1% |
| -3.4 | 1 | < 0.1% |
| -3.3 | 1 | < 0.1% |
| -2.9 | 3 | |
| -2.8 | 1 | < 0.1% |
| -2.6 | 1 | < 0.1% |
| -2.5 | 1 | < 0.1% |
| -2.3 | 1 | < 0.1% |
| -2.1 | 1 | < 0.1% |
| -2 | 2 |
| Value | Count | Frequency (%) |
| 26.3 | 3 | |
| 26.2 | 1 | < 0.1% |
| 25.9 | 1 | < 0.1% |
| 25.7 | 2 | |
| 25.6 | 1 | < 0.1% |
| 25.4 | 3 | |
| 25.3 | 2 | |
| 25.2 | 1 | < 0.1% |
| 25.1 | 2 | |
| 25 | 1 | < 0.1% |
| Distinct | 69 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 80.54593741 |
| Minimum | 24 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.5 KiB |
Quantile statistics
| Minimum | 24 |
|---|---|
| 5-th percentile | 58 |
| Q1 | 73 |
| median | 82 |
| Q3 | 90 |
| 95-th percentile | 97 |
| Maximum | 100 |
| Range | 76 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 11.91872934 |
|---|---|
| Coefficient of variation (CV) | 0.1479743079 |
| Kurtosis | 0.2126581319 |
| Mean | 80.54593741 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.7111188468 |
| Sum | 561083 |
| Variance | 142.0561091 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 87 | 256 | 3.7% |
| 88 | 255 | 3.7% |
| 82 | 251 | 3.6% |
| 84 | 238 | 3.4% |
| 89 | 232 | 3.3% |
| 79 | 230 | 3.3% |
| 85 | 223 | 3.2% |
| 86 | 222 | 3.2% |
| 83 | 222 | 3.2% |
| 91 | 222 | 3.2% |
| Other values (59) | 4615 |
| Value | Count | Frequency (%) |
| 24 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 39 | 2 | |
| 40 | 4 | |
| 41 | 3 |
| Value | Count | Frequency (%) |
| 100 | 100 | |
| 99 | 63 | 0.9% |
| 98 | 88 | 1.3% |
| 97 | 136 | |
| 96 | 148 | |
| 95 | 200 | |
| 94 | 188 | |
| 93 | 208 | |
| 92 | 200 | |
| 91 | 222 |
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.811369509 |
| Minimum | 1 |
|---|---|
| Maximum | 35 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 6 |
| median | 8 |
| Q3 | 11 |
| 95-th percentile | 17 |
| Maximum | 35 |
| Range | 34 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.38365003 |
|---|---|
| Coefficient of variation (CV) | 0.4974992849 |
| Kurtosis | 1.645777081 |
| Mean | 8.811369509 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.002281537 |
| Sum | 61380 |
| Variance | 19.21638759 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=33)
| Value | Count | Frequency (%) |
| 7 | 722 | |
| 6 | 702 | |
| 8 | 660 | |
| 5 | 618 | |
| 9 | 573 | 8.2% |
| 10 | 560 | 8.0% |
| 4 | 508 | 7.3% |
| 11 | 464 | 6.7% |
| 12 | 346 | 5.0% |
| 3 | 340 | 4.9% |
| Other values (23) | 1473 |
| Value | Count | Frequency (%) |
| 1 | 31 | 0.4% |
| 2 | 163 | 2.3% |
| 3 | 340 | |
| 4 | 508 | |
| 5 | 618 | |
| 6 | 702 | |
| 7 | 722 | |
| 8 | 660 | |
| 9 | 573 | |
| 10 | 560 |
| Value | Count | Frequency (%) |
| 35 | 2 | < 0.1% |
| 33 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30 | 5 | |
| 29 | 4 | |
| 28 | 4 | |
| 27 | 4 | |
| 26 | 3 | < 0.1% |
| 25 | 6 | |
| 24 | 9 |
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.64053976 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.689671829 |
|---|---|
| Coefficient of variation (CV) | 0.5555864414 |
| Kurtosis | -1.177659354 |
| Mean | 15.64053976 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.004253637489 |
| Sum | 108952 |
| Variance | 75.51039649 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=31)
| Value | Count | Frequency (%) |
| 24 | 240 | 3.4% |
| 18 | 239 | 3.4% |
| 11 | 236 | 3.4% |
| 22 | 236 | 3.4% |
| 12 | 236 | 3.4% |
| 13 | 235 | 3.4% |
| 23 | 235 | 3.4% |
| 15 | 235 | 3.4% |
| 17 | 234 | 3.4% |
| 16 | 234 | 3.4% |
| Other values (21) | 4606 |
| Value | Count | Frequency (%) |
| 1 | 224 | |
| 2 | 225 | |
| 3 | 225 | |
| 4 | 232 | |
| 5 | 231 | |
| 6 | 225 | |
| 7 | 234 | |
| 8 | 225 | |
| 9 | 232 | |
| 10 | 222 |
| Value | Count | Frequency (%) |
| 31 | 106 | |
| 30 | 192 | |
| 29 | 210 | |
| 28 | 209 | |
| 27 | 233 | |
| 26 | 230 | |
| 25 | 227 | |
| 24 | 240 | |
| 23 | 235 | |
| 22 | 236 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.557565317 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.437607482 |
|---|---|
| Coefficient of variation (CV) | 0.5242200901 |
| Kurtosis | -1.196971805 |
| Mean | 6.557565317 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.04555345272 |
| Sum | 45680 |
| Variance | 11.8171452 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=12)
| Value | Count | Frequency (%) |
| 8 | 618 | |
| 10 | 617 | |
| 7 | 600 | |
| 6 | 599 | |
| 1 | 599 | |
| 9 | 585 | |
| 11 | 583 | |
| 5 | 566 | |
| 12 | 564 | |
| 3 | 555 | |
| Other values (2) | 1080 |
| Value | Count | Frequency (%) |
| 1 | 599 | |
| 2 | 544 | |
| 3 | 555 | |
| 4 | 536 | |
| 5 | 566 | |
| 6 | 599 | |
| 7 | 600 | |
| 8 | 618 | |
| 9 | 585 | |
| 10 | 617 |
| Value | Count | Frequency (%) |
| 12 | 564 | |
| 11 | 583 | |
| 10 | 617 | |
| 9 | 585 | |
| 8 | 618 | |
| 7 | 600 | |
| 6 | 599 | |
| 5 | 566 | |
| 4 | 536 | |
| 3 | 555 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 54.5 KiB |
| 2021 | |
|---|---|
| 2022 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021 |
|---|---|
| 2nd row | 2021 |
| 3rd row | 2021 |
| 4th row | 2021 |
| 5th row | 2021 |
Common Values
| Value | Count | Frequency (%) |
| 2021 | 5823 | |
| 2022 | 1143 | 16.4% |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 2021 | 5823 | |
| 2022 | 1143 | 16.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 KiB |
| False | |
|---|---|
| True | 133 |
| Value | Count | Frequency (%) |
| False | 6833 | |
| True | 133 | 1.9% |
dayofweek_n
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.031007752 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 962 |
| Zeros (%) | 13.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.989971351 |
|---|---|
| Coefficient of variation (CV) | 0.6565378625 |
| Kurtosis | -1.241328444 |
| Mean | 3.031007752 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.02522150698 |
| Sum | 21114 |
| Variance | 3.959985976 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 5 | 1031 | |
| 4 | 1021 | |
| 2 | 997 | |
| 3 | 993 | |
| 6 | 988 | |
| 1 | 974 | |
| 0 | 962 |
| Value | Count | Frequency (%) |
| 0 | 962 | |
| 1 | 974 | |
| 2 | 997 | |
| 3 | 993 | |
| 4 | 1021 | |
| 5 | 1031 | |
| 6 | 988 |
| Value | Count | Frequency (%) |
| 6 | 988 | |
| 5 | 1031 | |
| 4 | 1021 | |
| 3 | 993 | |
| 2 | 997 | |
| 1 | 974 | |
| 0 | 962 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 54.5 KiB |
| Saturday | |
|---|---|
| Friday | |
| Wednesday | |
| Thursday | |
| Sunday | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.150301464 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Monday |
|---|---|
| 2nd row | Monday |
| 3rd row | Monday |
| 4th row | Monday |
| 5th row | Monday |
Common Values
| Value | Count | Frequency (%) |
| Saturday | 1031 | |
| Friday | 1021 | |
| Wednesday | 997 | |
| Thursday | 993 | |
| Sunday | 988 | |
| Tuesday | 974 | |
| Monday | 962 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| saturday | 1031 | |
| friday | 1021 | |
| wednesday | 997 | |
| thursday | 993 | |
| sunday | 988 | |
| tuesday | 974 | |
| monday | 962 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 4833 | |
| False | 2133 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 54.5 KiB |
| Summer | |
|---|---|
| Autumn | |
| Spring | |
| Winter |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Winter |
|---|---|
| 2nd row | Winter |
| 3rd row | Winter |
| 4th row | Winter |
| 5th row | Winter |
Common Values
| Value | Count | Frequency (%) |
| Summer | 1847 | |
| Autumn | 1741 | |
| Spring | 1704 | |
| Winter | 1674 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| summer | 1847 | |
| autumn | 1741 | |
| spring | 1704 | |
| winter | 1674 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.9 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 4585 | |
| True | 2381 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 54.5 KiB |
| Afternoon | |
|---|---|
| Morning | |
| Evening | |
| Night |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.191788688 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Night |
|---|---|
| 2nd row | Morning |
| 3rd row | Morning |
| 4th row | Morning |
| 5th row | Morning |
Common Values
| Value | Count | Frequency (%) |
| Afternoon | 2132 | |
| Morning | 1697 | |
| Evening | 1673 | |
| Night | 1464 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| afternoon | 2132 | |
| morning | 1697 | |
| evening | 1673 | |
| night | 1464 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 54.5 KiB |
| no rain | |
|---|---|
| drizzle | 336 |
| moderate rain | 224 |
| light rain | 72 |
| heavy rain | 9 |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 7.227820844 |
| Min length | 7 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | no rain |
|---|---|
| 2nd row | no rain |
| 3rd row | no rain |
| 4th row | no rain |
| 5th row | no rain |
Common Values
| Value | Count | Frequency (%) |
| no rain | 6325 | |
| drizzle | 336 | 4.8% |
| moderate rain | 224 | 3.2% |
| light rain | 72 | 1.0% |
| heavy rain | 9 | 0.1% |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| rain | 6630 | |
| no | 6325 | |
| drizzle | 336 | 2.5% |
| moderate | 224 | 1.6% |
| light | 72 | 0.5% |
| heavy | 9 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
count
Real number (ℝ≥0)
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.754378409 |
| Minimum | 1 |
|---|---|
| Maximum | 26 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 54.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 7 |
| 95-th percentile | 11 |
| Maximum | 26 |
| Range | 25 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.442080321 |
|---|---|
| Coefficient of variation (CV) | 0.7239811442 |
| Kurtosis | 1.566956878 |
| Mean | 4.754378409 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.177331745 |
| Sum | 33119 |
| Variance | 11.84791694 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=24)
| Value | Count | Frequency (%) |
| 1 | 1229 | |
| 2 | 1000 | |
| 3 | 898 | |
| 4 | 775 | |
| 5 | 645 | |
| 6 | 597 | |
| 7 | 471 | 6.8% |
| 8 | 368 | 5.3% |
| 9 | 274 | 3.9% |
| 10 | 208 | 3.0% |
| Other values (14) | 501 |
| Value | Count | Frequency (%) |
| 1 | 1229 | |
| 2 | 1000 | |
| 3 | 898 | |
| 4 | 775 | |
| 5 | 645 | |
| 6 | 597 | |
| 7 | 471 | 6.8% |
| 8 | 368 | 5.3% |
| 9 | 274 | 3.9% |
| 10 | 208 | 3.0% |
| Value | Count | Frequency (%) |
| 26 | 1 | < 0.1% |
| 24 | 2 | < 0.1% |
| 23 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 20 | 5 | 0.1% |
| 19 | 8 | 0.1% |
| 18 | 8 | 0.1% |
| 17 | 16 | |
| 16 | 19 | |
| 15 | 32 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| date | hour | rain | temp | rhum | wdsp | day | month | year | holiday | dayofweek_n | dayofweek | working_day | season | peak | timesofday | rain_type | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2021-03-01 | 2 | 0.0 | -1.2 | 98 | 4 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | False | Night | no rain | 1 |
| 1 | 2021-03-01 | 7 | 0.0 | 2.1 | 100 | 4 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | True | Morning | no rain | 3 |
| 2 | 2021-03-01 | 8 | 0.0 | 5.1 | 98 | 5 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | True | Morning | no rain | 1 |
| 3 | 2021-03-01 | 9 | 0.0 | 5.7 | 98 | 5 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | True | Morning | no rain | 4 |
| 4 | 2021-03-01 | 10 | 0.0 | 6.7 | 94 | 6 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | True | Morning | no rain | 4 |
| 5 | 2021-03-01 | 11 | 0.0 | 7.4 | 91 | 8 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | False | Morning | no rain | 4 |
| 6 | 2021-03-01 | 12 | 0.0 | 6.9 | 88 | 8 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | False | Afternoon | no rain | 8 |
| 7 | 2021-03-01 | 13 | 0.0 | 9.3 | 84 | 8 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | False | Afternoon | no rain | 11 |
| 8 | 2021-03-01 | 14 | 0.0 | 9.3 | 80 | 9 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | False | Afternoon | no rain | 11 |
| 9 | 2021-03-01 | 15 | 0.0 | 8.3 | 79 | 11 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | True | Afternoon | no rain | 10 |
Last rows
| date | hour | rain | temp | rhum | wdsp | day | month | year | holiday | dayofweek_n | dayofweek | working_day | season | peak | timesofday | rain_type | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6956 | 2022-02-27 | 13 | 0.0 | 8.6 | 78 | 15 | 27 | 2 | 2022 | False | 6 | Sunday | False | Winter | False | Afternoon | no rain | 8 |
| 6957 | 2022-02-27 | 14 | 0.0 | 8.9 | 80 | 17 | 27 | 2 | 2022 | False | 6 | Sunday | False | Winter | False | Afternoon | no rain | 10 |
| 6958 | 2022-02-27 | 15 | 0.0 | 8.6 | 84 | 16 | 27 | 2 | 2022 | False | 6 | Sunday | False | Winter | False | Afternoon | no rain | 4 |
| 6959 | 2022-02-27 | 16 | 0.0 | 8.7 | 86 | 17 | 27 | 2 | 2022 | False | 6 | Sunday | False | Winter | False | Afternoon | no rain | 3 |
| 6960 | 2022-02-27 | 17 | 0.0 | 8.5 | 89 | 16 | 27 | 2 | 2022 | False | 6 | Sunday | False | Winter | False | Afternoon | no rain | 8 |
| 6961 | 2022-02-27 | 18 | 0.0 | 8.7 | 70 | 10 | 27 | 2 | 2022 | False | 6 | Sunday | False | Winter | False | Evening | no rain | 4 |
| 6962 | 2022-02-27 | 19 | 0.0 | 8.0 | 72 | 9 | 27 | 2 | 2022 | False | 6 | Sunday | False | Winter | False | Evening | no rain | 2 |
| 6963 | 2022-02-27 | 20 | 0.0 | 8.6 | 66 | 14 | 27 | 2 | 2022 | False | 6 | Sunday | False | Winter | False | Evening | no rain | 1 |
| 6964 | 2022-02-27 | 21 | 0.0 | 9.0 | 68 | 11 | 27 | 2 | 2022 | False | 6 | Sunday | False | Winter | False | Evening | no rain | 2 |
| 6965 | 2022-02-27 | 22 | 0.2 | 8.6 | 74 | 10 | 27 | 2 | 2022 | False | 6 | Sunday | False | Winter | False | Evening | drizzle | 2 |